Compositional Data Analysis
نویسندگان
چکیده
Compositional data are nonnegative carrying relative, rather than absolute, information—these often with a constant-sum constraint on the sample values, for example, proportions or percentages summing to 1% 100%, respectively. Ratios between components of composition important since they unaffected by particular set chosen. Logarithms ratios (logratios) fundamental transformation in ratio approach compositional analysis—all thus need be strictly positive, so that zero values present major problem. Components group together based domain knowledge can amalgamated (i.e., summed) create new components, and this alleviate problem zeros. Once transformed logratios, regular univariate multivariate statistical analysis performed, such as dimension reduction clustering, well modeling. Alternative methodologies come close ideals logratio also considered, especially those avoid zeros, which is particularly acute large bioinformatic sets.
منابع مشابه
Correlation Analysis for Compositional Data
Compositional data need a special treatment prior to correlation analysis. In this paper we argue why standard transformations for compositional data are not suitable for computing correlations, and why the use of raw or log-transformed data is neither meaningful. As a solution, a procedure based on balances is outlined, leading to sensible correlation measures. The construction of the balances...
متن کاملRobust factor analysis for compositional data
Factor analysis as a dimension reduction technique is widely used with compositional data. Using the method for raw data or for improperly transformed data will, however, lead to biased results and consequently to misleading interpretations. Although some procedures, suitable for factor analysis with compositional data, were already developed, they require pre-knowledge of variable groups, or a...
متن کاملLecture Notes on Compositional Data Analysis
Preface These notes have been prepared as support to a short course on compositional data analysis. Their aim is to transmit the basic concepts and skills for simple applications, thus setting the premises for more advanced projects. One should be aware that frequent updates will be required in the near future, as the theory presented here is a field of active research. The notes are based both...
متن کاملClustering compositional data trajectories
This work is motivated by the following question: given a sample of compositional data trajectories (i.e. sequences of composition measurements along a domain), how can one propose a segmentation procedure leading to homogeneous classes? In other words, our contribution aims at studying statistical methods suited for clustering compositional data, when the observations are constituted by trajec...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Annual review of statistics and its application
سال: 2021
ISSN: ['2326-8298', '2326-831X']
DOI: https://doi.org/10.1146/annurev-statistics-042720-124436